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^ ■ Abstract 

We study zero-temperature stability of topological phases of matter under weak time- 
On! independent perturbations. Our results apply to quantum spin Hamiltonians that can be written 
as a sum of geometrically local commuting projectors on a D-dimensional lattice with certain 
topological order conditions. Given such a Hamiltonian Hq we prove that there exists a con- 
stant threshold e > such that for any perturbation V representable as a sum of short-range 
bounded-norm interactions the perturbed Hamiltonian H = Hq + eV has well-defined spectral 
bands originating from O(l) smallest eigenvalues of Hq. These bands are separated from the 
rest of the spectrum and from each other by a constant gap. The band originating from the 
t^J- ' smallest eigenvalue of Hq has exponentially small width (as a function of the lattice size). 

Our proof exploits a discrete version of Hamiltonian flow equations, the theory of relatively 
bounded operators, and the Lieb-Robinson bound. 
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1 Introduction 



The traditional classification of different phases of matter due to Landau rests on symmetry 
breaking. Given a pair of gapped Hamiltonians Hi, H2 with some symmetry group G, the ground 
states of Hi and H2 were considered to be in different phases if their symmetry breaking patterns 
are different. The discovery of topologically ordered phases, however, changes this paradigm. 
Models such as Kitaev's toric code p] have "topologically non-trivial" ground states despite 
lacking any symmetry breaking. Such states cannot be changed into a "topologically trivial" 
state such as a product state by any unitary locality- preserving operator [2]. 

One possible approach to classifying topological phases is to call a pair of gapped Hamilto- 
nians Hi , H2 topologically equivalent iff it is possible to connect Hi and H2 by a continuous 
path in the space of local gapped Hamiltonians. Using the idea of quasi-adiabatic continua- 
tion [3], one can describe the evolution of the ground state subspace along such a path by a 
unitary locality-preserving operator. In particular ground state degeneracy and the geometry 
of "logical operators" acting on the ground subspace is the same for Hi and Hi- 

Most of the Hamiltonians describing TQO models such as Kitaev's quantum double model [1] 
or Levin- Wen string-net model [I] are not quite physical since they involve interactions affecting 
more than two spins at a time. One may hope however that such models emerge as low-energy 
effective Hamiltonians describing some simpler high-energy theories [51 El [7]. For example, the 
toric code model with four-spin interactions can be "implemented" as the fourth-order effective 
Hamiltonian describing low-energy limit of the honeycomb model [S] which involves only two- 
spin interactions. The higher-order corrections to the effective Hamiltonian must be regarded 
as a perturbation. Thus in order to show that the honeycomb model is topologically equivalent 
to the toric code (in the J z S> J x , Jy phase) one has to prove that the spectral gap in the toric 
code Hamiltonian does not close in a presence of weak perturbations V that can be represented 
as a sum of bounded-norm short-range (exponentially decaying) interactions. 

Even if one leaves aside the question of how multi-spin interactions can be implemented 
in a lab, one has to worry about precision up to which an ideal model Hamiltonian can be 
approximated in a real life. If the presence or absence of the gap depends on tiny variations of 
the Hamitonian parameters that are beyond experimentalist's control, the distinction between 
gapped and gapless Hamiltonians is meaningless. The best we can hope for is to approximate 
individual interactions of the ideal model with some constant precision e independent of the 
system size N. Accordingly, the ideal Hamiltonian can be approximated only up to an extensive 
error O(eN). Proving stability of topological phases thus reduces to proving that the spectral 
gap of the ideal TQO models does not close in the presence of such extensive perturbations. 

Currently, the tools for proving lower bounds on the spectral gap are fairly limited. For 
example, one of the outstanding problems in mathematical physics is to prove the existence of 
a spectral gap for the spin-1 Heisenberg chain, making rigorous the arguments of Haldane [9|. 
Some progress toward this was obtained by Yarotsky [10J, who showed the stability of the gap 
near the AKLT point [llj. Yarotsky's tools however are limited to perturbations of Hamiltonians 
which are topologically trivial. Thus, new methods are needed to analyze topologically ordered 
phases. Some partial results were recently obtained by Trebst et al [12] and Klich p3] who 
proved gap stability for the toric code under a special type of perturbations diagonal in the 
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z-basis as well as for anyon lattices on a sphere. 

In the present paper we succeed in proving gap stability under generic local perturbations. 
Our results are valid not just for the toric code, but more generally for any Hamiltonian which 
can be written as a sum of geometrically local commuting projectors on a D-dimensional lattice 
with certain topological order conditions that we define later. This includes models such as 
Kitaev's quantum double model pQ and the Levin- Wen string- net model flj. Furthermore, we 
prove stability of the spectral gaps separating sufficiently low-lying eigenvalues of the unper- 
turbed Hamiltonian. In the case of 2D models with anyonic excitations it allows us to define 
string-like operators that create particle excitations for the perturbed Hamiltonian and prove 
stability of invariants describing the braiding statistics of excitations. We explain how this may 
be used to adiabatically control a perturbed topological model to perform braiding operations 
to manipulate topologically protected quantum information. 

1.1 Summary of results 

Consider a system composed of finite-dimensional quantum particles (qudits) occupying sites 
of a D-dimensional lattice A of linear size L. The corresponding Hilbert space is a tensor 
product of the local Hilbert spaces, % = ^ ue ^~H u , dim (H u ) = 0(1). Suppose the unperturbed 
Hamiltonian Hq can be written as a sum of geometrically local pairwise commuting projectors, 

ACA 

where the sum runs over all subsets of the lattice of diameter O(l) and Qa is a projector acting 
non-trivially only on sites of A (one may have Qa = for some subsets A). The commutativity 
assumption implies that all projectors Qa can be diagonalized in the same basis. Accordingly, 
all eigenvalues of Hq are non-negative integers. We assume that the smallest eigenvalue of Hq 
is zero, that is, ground states of Hq are annihilated by every projector Qa- Such states span 
the ground subspace P, 

P = {\i>) en : Q A \i/>) = for all AC A}. 

For any subset B C A we shall also define a local ground subspace as 

P B = £-H : Q A \4>) = for all A C B}. 

We shall use the notations P and Pb both for linear subspaces and for the corresponding 
projectors. Note that the projector Pg acts non-trivially only on the subset B. 

We shall impose two extra conditions on Hq and the ground subspace P that guarantee the 
gap stability. Let us first state these conditions informally (see Section [2T2l for formal definitions): 

TQO-1: The ground subspace P is a quantum code with a macroscopic distance^, 
TQO-2: Local ground subspaces are consistent with the global one 

For our purposes it suffices that the distance grows as a positive power of the lattice size L 
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Condition TQO-1 is the traditional definition of TQO. It guarantees that a local operator cannot 
induce transitions between orthogonal ground states or distinguish a pair of orthogonal ground 
states from each other. Thus a local perturbation can lift the ground state degeneracy only in 
the n-th order of perturbation theory, where n can be made arbitrarily large by increasing the 
lattice size, see pp. Surprising, condition TQO-1 by itself is not sufficient for stability, see a 
simple counter-example in Section [2.4i 

Condition TQO-2 demands that a local ground subspace Pb and the global ground subspace 
P must be consistent, namely, the projectors Pb and P must have the same reductions on any 
subset A C B which is "sufficiently far" from the boundary of B. We need to impose TQO-2 
only for regions with trivial topology such a cube or a ball. The consistency between the global 
and the local ground subspaces may be violated for regions with non-trivial topology. For 
example, if B has a hole, the local ground subspace Pb may include sectors with a non-trivial 
topological charge inside the hole as opposed to the global ground subspace. Condition TQO-2 
by itself is also not sufficient for stability, see a counter-example in Section 12.41 

Let us emphasize that all our results apply also to the special case when Hq has non- 
degenerate ground state. In this case TQO-1 is automatically satisfied since P is a one- 
dimensional subspace and thus condition TQO-2 alone guarantees the gap stability. 

We consider a perturbation V that can be written as a sum of bounded-norm interactions 

y = E E k,a, 

r>\ AeS{r) 

where S{r) is a set of cubes of linear size r and V Ti a is an operator acting on sites of A. We 
assume that the magnitude of interactions decays exponentially for large r, 

max < Je _Atr , 

AeS(r) 

where J, [i > are some constants independent of L. Our main result is the following. 

Theorem 1. Suppose Hq obeys TQO-1, 2. Then there exist constants Jq,ci,C2 > depending 
only on [i and the spatial dimension D such that for all J < Jq the spectrum of Hq + V is 
contained (up to an overall energy shift) in the union of intervals Ufc>o^» where k runs over 
the spectrum of Hq and 

I k = {A € R : k(l - ci J) —6<\<k(l + ci J) + 5}, 

and 

5 = poly(L) exp (-c 2 L 3/8 ). 

In other words, the perturbation V changes positive eigenvalues of Hq at most by a constant 
factor 1 ± ci J (neglecting the exponentially small correction 5) while the smallest eigenvalue 
k = is transformed into a band Iq of exponentially small width 25, see Fig. [TJ One can easily 
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check that for any fixed k the band Ik is separated from all other bands I m , m ^ k, by a gap 
at least 1/2 provided that J < Jk, where 



ci(4fe + 2)' 

Thus the bands originating from eigenvalues 0, 1, . . . , k of Hq are separated from each other and 
from the rest of the spectrum by a gap at least 1/2 provided that J < Jk- 

In the case when excitations of Hq are anyons, one can infer all topological invariants such 
as S, R, and F-matrices by evaluating fusion and braiding diagrams with only a few particles 
(for example 4 particles suffice to compute all F matrices). Accordingly, any matrix element of, 
say, F-matrix, can be represented as an expectation value (ipo\O m ■ ■ • C^OilV'o) where \ipo) is the 
ground state and Oj are operators creating pairs of excitations from the ground state, moving, 
fusing, and annihilating them. Our stability result for excited states with O(l) excitations allows 
us to construct quasi-adiabatic continuation of operators Oj thus explicitly demonstrating that 
the perturbed Hamiltonian has the same S, R, and F matrices as the ideal one, see Section [7J 

Energy 




Figure 1: Energy bands Ik describing the spectrum of a perturbed Hamiltonian Hq + V. 



1.2 Sketch of the stability proof 

Let us sketch the main steps of the proof of Theorem [TJ We start from proving the theorem for 
a special class of perturbation V such that all individual interactions V Tj a preserve the ground 
subspace P, that is, [P, V Tt A\ = 0. We call such perturbations block- diagonal. In Section [3] we 
prove that block-diagonal perturbations are relatively boundedby Hq, that is, \\Vijj\\ < 6||flo^|| 
for any state \ip) G H and for some coefficient b = O(J). Here for simplicity we ignore some 
exponentially small corrections. A nice feature of relatively bounded perturbations is that the 
spectrum of a perturbed Hamiltonian Hq + V is contained in the union of intervals Ik where k 
runs over the spectrum of Hq and Ik = (k(l — b), k(l + 6)), see Section I3TT1 The proof of the 
relative boundness is rather elementary and uses certain decomposition of the Hilbert space in 
terms of syndrome subspaces which is a standard tool in the theory of quantum error correcting 
codes. In order to get a strong enough bound on the coefficient b we use a novel technique of 
"coarse-graining" the syndrome subspaces, see Section [3.21 for details. 



6 



In the second part of the proof we reduce generic perturbations V to block-diagonal pertur- 
bations. Specifically, we construct a unitary operator U such that U(Hq + V)W ~ Hq + W, 
where W is a block-diagonal perturbation. Since U does not change eigenvalues, we can use 
the techniques described above to analyze the spectrum of Hq + V. The operator U is con- 
structed using a discrete version of Hamiltonian flow equations developed by Glazek, Wilson, 
and Wegner [T3]. Specifically, we define a hierarchy of Hamiltonians H{n) = Hq + V(n) + W(n) 
labeled by an integer level n > 0, such that W(n) is a block-diagonal perturbation while V{n) 
is a generic perturbation. We start at the level n = with the perturbed Hq + V, that is, 
V(0) = V and W(0) = 0. As we go to higher levels, the Hamiltonian H(n) becomes more close 
to a block-diagonal form. The transformation from H{n) to H(n + 1) is described by a unitary 
operator U{n) that block-diagonalizes H(n) up to errors of order V(n) 2 . These errors are dealt 
with at the next level of the hierarchy, see Section H] for details. We construct U(n) by solving a 
linearized block-diagonalization problem, see Section The solution can be easily constructed 
in terms of the series while convergence of the series follows from the fact that Win) is relatively 
bounded by Hq. 

We prove that the strength of V(n) decays doubly-exponentially as a function of n, while 
W(n) does not change essentially after the first few levels. We then choose the desired unitary 
operator U as U = U(nj) ■ ■ ■ U(1)U(0) where the highest level nf ~ log (L) is chosen to make 
the norm of V(nf) exponentially small (as a function of L). The most technical part of the proof 
is to show that the unitary operators U{n) are locality preserving such that all Hamiltonians 
V(n) and W(n) remain sufficiently local. To this end we first prove that U(n) can be generated 
by a quasi-local Hamiltonian, see Section [5l and then employ the Lieb-Robinson bound, see 
Section [6l 

2 Hamiltonians describing TQO 

2.1 Frustration-free commuting Hamiltonians 

To simplify notations we shall restrict ourselves to the spatial dimension D = 2. A generalization 
to an arbitrary D is straightforward. Let A = x be a two-dimensional square lattice of 
linear size L with periodic boundary conditions. We assume that every site u £ A is occupied 
by a finite-dimensional quantum particle (qudit) such that the Hilbert space describing A is a 
tensor product 

H = (g)H u , dim?4 = 0(l). 

Let S(r) be a set of all square blocks A C A of size r x r, where r is a positive integer. Note that 
S(r) contains L 2 translations of some elementary square of size r x r for all r < L, S(L) = A, 
and S(r) = for r > L. We can always assume that the unperturbed Hamiltonian Hq involves 
only 2x2 interactions (otherwise consider a coarse-grained lattice): 

Hq= Y, 

A€S(2) 



7 



There will be three essential restrictions on the form of interactions Ga- Firstly, we require that 
Ga are pairwise commuting operators, that is, 

G a Gb = G b Ga for all A, Be 5(2). 

Thus all interactions Ga can be diagonalized in the same basis. Secondly, we require that Hq 
is a frustration free Hamiltonian, that is, the ground state of Hq minimizes energy of every 
individual term Ga- Performing an overall energy shift we can always assume that all Ga are 
positive-semidefinite operators, 

G A >0. 

Then the condition of being frustration-free demands that ground states of Hq are common zero 
eigenvectors of every term Ga- Thus the ground subspace of Hq is 

P = {\i>) eU : G A \il>) =0 for all A G 5(2)}. (2.2) 

Thirdly, we shall assume that every operator Ga has a constant spectral gap, that is, the smallest 
positive eigenvalue of G A is bounded from below by a constant independent of the lattice size 
L. We can always normalize the Hamiltonian Hq such that the spectral gap of any Ga is at 
least 1. This is equivalent to a condition 

G\ > Ga- 

Let Pa be the projector onto the zero subspace of G A and Qa = I — Pa- Note that all the 
projectors Pa,Qa are pairwise commuting. For any square B G S(r), r > 2 define a projector 
onto the local ground subspace 

Pb= 11 Pa (2.3) 

AeS(2) 
ACB 

and Qb = I — Pb- Note that Pb and Qb have support on B. We shall often use the same 
notation for a subspace and for the corresponding projector. 

2.2 Formal definition of TQO 

We shall need two extra property of Hq and the ground subspace P that guarantee the gap 
stability and robustness of the ground state degeneracy. We shall assume that there exists 
a constant c > such that the following conditions hold for some integer L* > cL for all 
sufficiently large L: 
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TQO-1: Let A £ S(r) be any square of size r < L*. Let Oa be any operator acting on A. 
Then 

PO A P = cP 

for some complex number c. 

TQO-2: Let A G <S(r) be any square of size r < L* and let -B G <S(r + 2) be the square 
that contains ^4 and all nearest neighbors of A. Define reduced density matrices 
PA = TV J 4c(P) and p A = Tta c {Pb)- Then the kernel of pa coincides with the 
kernel of pf . 



Remark 1. Using the language of quantum error correcting codes one can define the minimum 
distance of P as the smallest integer d such that erasure of any subset of d particles can be 
corrected for any encoded state \ip) G P, see [15] for details. Note that TQO-1 holds for 
L* = [Vd\ since the reduced state of any square A G S(L*) does not depend on the encoded 
state. What is less trivial, TQO-1 holds also for L* = Q(d), see [15]. Thus L* coincides with 
the distance of the code P up to a constant coefficient (as far as condition TQO-1 is concerned). 

Remark 2. Condition TQO-2 can be easily 'proved' if the excitations of H§ are anyons 
(since the latter assumption lacks a rigorous formulation, the argument given below is not 
completely rigorous either). Indeed, in this case we can choose a complete basis of the excited 
subspace Q such that the basis vectors correspond to various configurations of anyons. For 
non-abelian theories one may have several basis vectors for a fixed configuration of anyons that 
describe different fusion channels, see [8|. Note that any state G Pb is a superposition 
of configurations with no anyons inside B. Since A is a topological trivial region, any such 
configuration can be prepared from the vacuum P by some unitary operator Ua c acting on 
complementary region A c = A\A. Thus \ip) = Ua^o) for some ground state |V>o) £ P- Since 
all ground states |^o) have the same reduced matrix on A, it means that \ip) and P have the 
same reduced matrix on A. This implies TQO-2. The above arguments suggest that TQO-2 
holds for all 2D models of TQO that can be described by commuting frustration-free such as 
quantum double models [1] and Levin- Wen string-net models [3]. 

Remark 3. As was already mentioned, the consistency between the global and the local 
ground subspaces may be violated for regions with non-trivial topology. For example, if A has 
a hole, the local ground subspace Pa may include sectors with a non-trivial topological charge 
inside the hole as opposed to the global ground subspace. 

We shall need the following corollary of TQO-2. 

Corollary 2.1. Let A G <S(r) be any square of size r < L* and Oa be any operator acting on 
A such that OaP = 0. Let B G S(r + 2) be the square that contains A and all nearest neighbors 
of A. Then O a Pb = 0. 

Proof. Let pa = Tta°{P)- The assumption OaPO a = implies that OaPa O a = 0, that is, Oa 
annihilates any state in the range of pa- From TQO-2, the range of Tr a c (Pb) coincides with 
the range of pa, and thus Tt(OaPbO a ) = 0. It implies OaPb = 0. □ 
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2.3 Verification of TQO conditions for stabilizer Hamiltonians 



Conditions TQO- 1,2 can be easily checked for those models of TQO that can be described using 
the stabilizer formalism such as the toric code model [lj or topological color codes [16J. For 
such models each site of the lattice A represents one or several qubits, while the ground state 
subspace P is a stabilizer code, i.e., the invariant subspace of some abelian stabilizer group 
Q C Pauli(A). Here Pauli(A) is a group generated by single-qubit Pauli operators af,af,af. 
The stabilizer group must have a set of geometrically local generators, that is, Q = (Si, . . . , Sm) 
where any generator S a £ Pauli(A) acts non-trivially only on O(l) qubits located within distance 
0(1) from each other. Note that the generators need not to be independent. We choose the 
corresponding stabilizer Hamiltonian Hq as 

H = J2(I-Sa)/2 

a 

such that states invariant under action of stabilizers have zero energy. The minimal distance of 
the code is the smallest integer d such that there exists a Pauli operator O that commutes with 
all elements of Q but does not belong to Q. Such an operator O can be regarded as a logical 
Pauli operator acting on encoded states. It follows from results of [15] that condition TQO-1 
holds if we choose L* = Cl(d). 

Assume that the set of qubits is coarse-grained into sites of the lattice A such that the 
support of any generator S a is contained in at least one 2x2 square. One can bring this 
Hamiltonian into the form Eq. ()2. lj) by distributing the generators over 2x2 squares in an 
arbitrary way. For any square B € S(r) one can define two subgroups of Q: (i) a subgroup Qb 
generated by generators S a whose support is contained in B, and (ii) a subgroup Q(B) that 
includes all stabilizers S E Q whose support is contained in B. By definition, Qb Q Q(B), but 
in general Qb / G{B). 

Lemma 2.1. The stabilizer Hamiltonian Hq satisfies condition TQO-2 iff for any square A £ 
S(r), r < L* , one has Q(A) C Q B , where B = h(A). 

Thus TQO-2 demands that any element of the stabilizer group whose support is contained 
in a square A can be written as a product of generators whose support is contained in A and a 
small neighborhood of A. We leave verification of this condition for the toric code model as an 
exercise for the reader. 

Proof. Indeed, the reduced density matrix pa computed using the global ground subspace P is 
proportional to the projector onto the codespace of the stabilizer code Q(A). The reduced density 
matrix pa computed using the local ground subspace Pb is proportional onto the codespace of 
the stabilizer code Qb(A), where Qb(A) includes all elements of Qb whose support is contained 
in A. Thus TQO-2 holds iff Q(A) = Qb{A). This is equivalent to the condition of the lemma. □ 
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2.4 Unstable version of the toric code model 



In this section we demonstrate that condition TQO-1 alone is not sufficient for stability. Let us 
start from the standard toric code model [1], 

Htc = - y] Bp - a s , 

p s 

where qubits live on edges of a square 2D lattice, p and s labels plaquettes and sites of the 
lattice, Bp is a product of a z over the four boundary edges of p, and A s is a product of four a x 
over the four edges incident to s. We shall refer to B p and A s as plaquette and star operators. 
The ground subspace P is defined by eigenvalue equations B p = 1 for all p and A s = 1 for all 
s. It is well known that P is a quantum code with the minimal distance d = L — 1. Hence P 
obeys TQO-1, 2 with L* = L - 1. 

Consider now a modified toric code model 

H'tc = - y~] B P B q - As ~ B P* 

(p,q) s 

where (p, q) labels pairs of adjacent plaquettes and p* is some selected plaquette. We assume 
that the total number of plaquettes N p is even. Note that H' tc is a frustration-free commuting 
Hamiltonian. In addition, H' tc and Ht c have the same ground subspace P corresponding to 
Bp = 1, A s = 1 for all s and p. Hence H' tc obeys TQO-1. We claim that H' tc violates TQO-2. 
Indeed, choose any square A located sufficiently far from the selected plaquette p*. Then the 
local ground subspace Pa has equal contributions from sectors B p = 1, A s = 1 and B p = — 1, 
A s = 1 thus being inconsistent with the global ground state. 

Let us now argue that the spectral gap of H' tc closes in a presence of local perturbations with 
a strength of order 1/N p . This instability has the same origin as the instability of the classical 
2D Ising model under external magnetic field. Note that H' tc has spectral gap A = 2 and the 
second smallest eigenvalue belongs to the sector A s = 1, B p = — 1 for all s and p. Consider a 
perturbation describing an "external magnetic field" , 

V = h Bp, h>0. 

v 

For sufficiently large h, say, h = 4:/N p , the ground state of H' tc + V moves from the sector 
A s = 1, Bp = 1 to the sector A s = 1, B p = — 1. Hence the gap above the ground state closes for 
some intermediate value of h. 

Needless to say, condition TQO-2 alone is also not sufficient for stability. The simplest 
counter-example is the 2D classical Ising model in which the gap is unstable under external 
magnetic field. 



11 



3 Relatively bounded perturbations 
3.1 Definition and basic properties 

In this section we introduce necessary facts from the theory of relatively bounded perturbations. 
It mostly follows Chapter IV of |17] although our definitions and proofs are much simpler since 
we are interested only in finite-dimensional Hilbert spaces. 

Let Hq and W be any Hamiltonians acting on some Hilbert space %. We shall say that W 
is relatively bounded by Hq iff there exist < b < 1 such that 

HW^II < b \\H ip\\ for all |V?) E U. (3.1) 

The notion of a relatively bounded perturbation allows one to define a "weak perturbation" 
and rigorously justify application of perturbative expansions even when the norm of W is much 
larger than the spectral gap of Hq. We shall be mostly interested in the case when b is a constant 
independent of the lattice size L. Note that the condition Eq. (13. If) is equivalent to an operator 
inequality W 2 < b 2 HQ. 

The following lemma asserts that a relatively bounded perturbation can change eigenvalues 
of Hq at most by a factor 1 ± b. 

Lemma 3.1. Suppose W is relatively bounded by Hq. Then the spectrum of Hq + W is contained 
in the union of intervals [Ao(l — b), Ao(l + b)] where Xq runs over the spectrum of Hq. 

Proof. Indeed, suppose (Hq + W) = A that is, 

(Hq-XI)\^) = -W\iP). (3.2) 
The relative boundness then implies [|(flo — AI)^|| < ^H-Ho^H, that is, 

WiHo- Xlfm <b 2 (i(;\HlW)- (3-3) 

Let Hq = ^2x ^oP\ be the spectral decomposition of Hq. Here the sum runs over the spectrum 
of Hq and Pa is a projector onto the eigenspace with an eigenvalue Ao- Define a probability 
distribution p(Ao) = {ip\Px Q \ip} ■ Substituting it into Eq. (13. 3ft one gets 

^(A - A) 2 p(A ) <Y^b 2 X 2 P (X ). (3.4) 

Ao Ao 

Therefore there exists at least one eigenvalue Ao such that 

(Ao - A) 2 < b 2 X 2 . (3.5) 
This is equivalent to A (l - b) < X < A (l + b). □ 
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3.2 Stability of TQO under block-diagonal perturbations 

In this section we shall consider perturbations 

W = ^ W A 

AeS(q) 

such that all local terms Wa are block-diagonal, 

[W A ,P}=0 for all A £ S(q). 

We shall assume that q < L*, so condition TQO-1 implies that the restriction of Wa onto the 
P-subspace is a multiple of the identity. Since we are not interested in the overall shift in energy, 
we can assume that 

W A P = for all A G S{q). (3.6) 
The interaction strength of W will be measured by a parameter 

w= max \\W A \\. (3.7) 

AeS(q) 

Lemma 3.2. Let W be a perturbation satisfying Eqs. h3. 6\3.1\) . Then W is relatively bounded 
by Hq with a constant 

b = 0(wq 2 ). 

Proof. Let us start from introducing some notations. A syndrome s : 5(2) — > {0, 1} is a function 
that assigns an eigenvalue sa £ {0,1} to every projector Qa, A £ 5(2). Given a syndrome s 
and a square A £ 5(2) we shall say that A is a defect iff sa = 1- Thus one can consider s as a 
configuration of defects. For any syndrome s define a projector 

Rs= J] [*aQa + (1-8 A )(I-Qa)] 

AeS{2) 

projecting onto a subspace spanned by states with a syndrome s. Clearly the family of projectors 
R s defines an orthogonal decomposition of the Hilbert space, that is, ^2 S R s = 1. 

Let us fix some partition of the lattice into contiguous q x q squares Si, ... , Bm £ <S{q) (if 
L is not a multiple of q, the squares E>i may have size q ± 0(1)). We shall refer to a set of 2 x 2 
squares contained in a particular square Bj as a box. We shall need the following properties: 

1. Every square A £ 5(2) is contained in exactly one box Bi, 

2. Each box Bi overlaps with 0(q 2 ) squares C £ 5(g), 

3. Each square C £ S(q) overlaps with O(l) boxes Bi. 

Given a syndrome s and a box Bi we shall say that Bi is occupied if B{ contains at least one 
defect, that is, there is a 2 x 2 square A C B such that sa = 1- Otherwise we shall say that the 
box Bi is empty. 
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Given a syndrome s let b(s) C [M] be the subset of occupied boxes. For any subset of boxes 
3^ C [M] define a projector 

Ry = R s . 

s:b(s)=y 

It projects onto the subspace in which all boxes in y are occupied and the remaining boxes 
are empty. Clearly, the family of projectors Ry defines an orthogonal decomposition, that is, 
Ylyc\M] Ry = I" We- claim that any operator Wa acting on a square A 6 S{q) and satisfying 
Eq. (|3.6p has only a few off-diagonal blocks with respect to this decomposition. Specifically, 
Corollary 12.11 implies that 

RyW A Rz ± (3.8) 

only if A has distance 0(1) from some occupied box in y and A has distance 0(1) from some 
occupied box in Z, and the configurations y,Z differ only at those boxes that overlap with 
A. Clearly, for any fixed y C [M] such that y has k occupied boxes the number of pairs 
(A £ S(q),Z C [M]) that could satisfy Eq. (|3.8j) is at most 0(kq 2 ). Thus for any state \ip) we 
get 

(iP\W 2 \iP) = (^\RyWR z WR v \iP) 
y,z,vc[M] 

< E • WRzWRvW ■ \\Ry^\\ ■ \\RvM\ 

y,z,vc[M] 

< E \\RyWR 2 \\-\\RzWR v \\-±({il>\RyW) + {mv\1>)) 
y,zyc[M] 

J2 \\RyWR z \\-\\RzWR v \\-(ip\Ry\4>) 
y,z,vc[M] 

< E E 0(k 2 qWmRy\^)=0{w 2 q i )^\G\^), (3.9) 



k>o y-.\y\=k 



where 



G = E E k2R y- ( 3 - 10 ) 

k>o y-.\y\=k 

The inequality Eq. (|3.9p follows from the fact that y and Z differ at at most O(l) boxes and 
an obvious bound k(k + 0(1)) = 0(k 2 ). Finally, note that G < Hq since any configuration of 
defects with k occupied boxes must have at least k defects and since creating a defect costs at 
least a unit of energy. We arrive at 

(ij\W 2 \ij)<b 2 (i;\H 2 ^), b = 0(wq 2 ). (3.11) 

It completes the proof. □ 

We shall also need a local version of Lemma l3.21 For any region OCA define a local version 
of the Hamiltonian Hq, 

H (C)= °a- (3-12) 

AeS(2) 
ACC 
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Let Pc and Qc be the local versions of the projectors P and Q defined in Eq. (|2.3|) . The 
following is a straightforward corollary of Lemma 13.21 

Corollary 3.1. Let C C A be any square of size smaller than L* . Let W = YlAeS(q) be a 
perturbation satisfying Eqs. 113.6^3. 7| ). Suppose also that Wa = unless C contains both A and 
the nearest neighbors of A. Then W is relatively bounded by Hq(C) with a constant b = 0(wq 2 ). 

Proof. Combining Eq. (|3.6p and TQO-2 we conclude that WaPc = f° r a U A. Thus we can 
apply all steps in the proof of Lemma [3. 21 to the square C considered as the entire lattice A. □ 

We shall need another technical lemma that provides a bound on the norm of a commutator 
involving a block-diagonal Hamiltonian. Let us start from the simplest scenario. Let W be any 
operator such that W is relatively bounded by Hq with a constant < b < 1. Then for any 
operator S we have 

\\Q[S,W]P\\ = \\QWSP\\ = \\QWH- 1 QH SP\\ 

= \\QWH l Q[H G ,S]P\\ < ll^ffo" 1 ^!! • \\Q[H ,S]P\\. 

Here the first equality follows from WP = and the third equality uses HqP = 0. Let £ Q be 
a normalized state such that || VF-ff^QH = HVF-ff^ 1 ^!!- Using the relative boundness assumption 
we get 

WWH^QW = H^fl^Vll < b\\H H^\\ = b. 
To conclude, we have proved that 

\\Q[S,W]P\\<b\\Q[S,H ]P\\. 

Applying the same arguments as above with P and Q replaced by their local versions Pc and 
Qc, see Eq. (j2.3j) . we arrive at the following lemma. 

Lemma 3.3. Let C C A be any region. Let W be any Hamiltonian such that W is relatively 
bounded by Hq(C) with a constant < b < 1. Then for any operator S one has 

\\Qc[S,W]P c \\ <b\\Qc[S,H (C)}P c \\. (3.13) 

Combining this lemma and Corollary 13. II we get a simple upper bound on HQ^fS 1 , W]Pc||- 

Corollary 3.2. Let C C A be any square of size smaller than L* . Let W = Y^A&S(q) ^A be a 
perturbation satisfying Eqs. \3.6\3. 7| ). Suppose also that Wa = unless C contains both A and 
the nearest neighbors of A. Then for any operator S one has 

\\Qc[S,W]P c \\ < 0(wq 2 ) \\Qc[S,H (C)]P c \\. (3.14) 



15 



4 Hamiltonian flow equations 



4.1 Outline of the method 

We are interested in the low-energy spectrum of a perturbed Hamiltonian Hq + V, where V is 
a local perturbation specified by a list of local interactions, 

Here V t% a is some interaction supported on a square A G <S(r). Our strategy will be to reduce the 
case of a generic perturbation to the special case of a block-diagonal perturbation which we can 
analyze using techniques of Section [3l To this end we shall define a hierarchy of Hamiltonians 
unitarily equivalent to H , 

n 

H(n)=H + Y,W{k) + V(n) + E{n) + X(n)I, n = 0,l,2,..., (4.2) 
k=l 

such that H(0) = Hq + V and H(n + 1) can be obtained from H{n) by a unitary transformation, 

£T(n + 1) = U{n)H{n)U{n)\ U{n)U{n)^ = I. (4.3) 

Accordingly, the spectrum of H(n) is the same for all n > 0. The purpose of the transformation 
U(n) is to make the Hamiltonian more close to the block-diagonal form. We shall refer to H(n) 
as a level-n Hamiltonian. 

Let us describe the purpose of various terms in Eq. (|4.2p . The Hamiltonian W(k) represents 
a block-diagonal contribution to the total Hamiltonian H(n) that has been created at the level 
k. The Hamiltonian V(n) represents the part of the total Hamiltonian H(n) that has to be 
block-diagonalized at the level n + 1. The Hamiltonians V(n) and W(n) will be represented by 
a sum of local interactions supported in squares of size r < L* , and such that all interactions 
involved in W(n) are individually block-diagonal, 

V (n)= £ £ V r , A (n), 

l<r<L* AeS(r) 

and 

W{n)= £ £ W r , A (n), where QW r , A {n)P = 0. 

l<r<L* AG5(r) 

All contributions from squares of size r > L* will be collected into the third Hamiltonian E(n) 
which can be regarded as an error Hamiltonian. The norm of E{n) will be exponentially small in 
L for all n. The norms of Win) and V{n) will decay roughly as doubly-exponential functions of 
n. Thus at level n ~ logL the total Hamiltonian H(n) will be block-diagonal up to corrections 
of order exp (-poly(L)) resulting from V(n) and E{n). Finally, \{n) is an overall energy shift 
that we shall often ignore. 



16 



We start at the level n = with initial conditions 

W rA (0) = 0, V rA {Q) = V r , A for r < L*, E(0) = ^ V ^ A ~ 

r>L* AeS(r) 

Accordingly, H(0) = Hq + V is the Hamiltonian we are interested in. Suppose we have already 
defined the Hamiltonians W(0), . . . , W(n), V = V(n), E = E(n) for some level n. Let 

n 

W = J^W(fc) 

k=l 

be the overall block-diagonal part of H(n). Let 

H = H(n) = H + W + V + E. 

We shall define the operator U(n) in Eq. (j4.3p as U(n) = exp (S), where S is the solution of a 
linearized block-diagonalization problem 

Q([S, H + W] + V)P = 0, 5 f = -S. (4.4) 

The meaning of this equation can be easily understood if one treats V as a perturbation and 
Ho + W as an unperturbed Hamiltonian. Expanding the transformed Hamiltonian e s He~ s in 
powers of S we get 

e s He~ s = H + W + ([S, H + W] + V) + 0(5 2 ) + O(SV) + O(S). 

Thus Eq. (|4.4|) says that the transformed Hamiltonian must be block-diagonal up to terms 
0(F 2 ) and 0(E). The solution of Eq. (14. 4h is constructed in Section 03, see Lemma 15.11 We 
start from defining raw versions of W(n + 1) and V(n + 1) which we shall denote W and V 
respectively, namely 

W =[S,H + W] + V, (4.5) 

and 

V = e s (H + W + V)e' s - (H + W + V + [S, H + W\). (4.6) 
A simple algebra shows that 

e s He~ s = Ho + W + W + V + e s Ee~ s . 

Note also that W is block-diagonal due to Eq. (|4.4j) . We shall construct a decomposition of 
W into a sum of local interactions that are individually block-diagonal using techniques of 
Section [51 see Corollary 15.11 of Lemma 15.11 It will yield 

W = Y Yl W r ,A> where QW r>A P = 0. 
r>l AgS(t) 

We shall construct a decomposition of V into a sum of local interactions using Lemma 14.11 and 
Lemma 16 . 1 1 arriving at 

r>l AeS(r) 
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Next we use W and V to define W{n + 1) and V(n + 1) by taking out all contributions from 
squares of size r > L* and adding these contributions to the error Hamiltonian, that is, 

W(n + 1)= J2 E W"»vl, 

l<r<L* AeS(r) 

V(n + 1)= J2 E ^,A, 

l<r<L* AeS(r) 

and 

S(n + 1) = e 5 £e" 5 + ^ ^ + K,A- 

r>L* AeS{r) 

For the detailed analysis of these flow equations see Section 14.31 

4.2 Local decompositions of Hamiltonians 

In order to analyze convergence of the flow equations we shall need to set up some notations 
and terminology. Recall that S(r) is a set of all r x r squares. 

Definition 4.1. Let V be any operator acting on %. A local decomposition of V is a list of 
operators {V r a}t A where r > 1 and A G S{r) such that V T) a has support on a square A and 

v = Yl E v ^ ( 4 - 7 ) 

r>l Ae5(r) 

Note that a local decomposition of an operator is not unique since the squares involved in 
the decomposition overlap with each other. Nevertheless, we shall often identify an operator 
and its local decomposition unless it may lead to confusions. 

Definition 4.2. A local decomposition of an operator V is (J, /i, a) -decaying iff 

max max \\V rA \\r a e^ < J. (4.8) 

r>l AeS(r) ' 

Here we mean that J, fi, a are some constants independent of L. We shall often use a term 
(J, /i, a)-decaying operator meaning that this operator has a local decomposition which is 
(J, n, a)-decaying. To simplify notations we shall often use an abbreviation (J, /i)-decay for 
(J, fi, 0)-decay. 

In the rest of this section we derive several auxiliary technical results that can be skipped 
at the first reading. 

We shall often need to construct a local decomposition for a commutator [S, V] given the 
local decompositions of S and V. 

Lemma 4.1. Suppose S is (K, fj,, a) -decaying and V is (J, /i, (3) -decaying for some a, (3 > 4. 
Then [S, V] has a local decomposition which is (cK J, fi)- decaying for some constant c. 
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Proof. Consider local decompositions of S and V, 

S = EE S p> a > \\S pA \\<Kp- a e-w, (4.9) 
P>1 AeS(p) 

v = Yl E y ^ ii^,b|| < (4.10) 

If A G <S(p) and B € 5(g) is a pair of non-overlapping squares, A n 2? / 0, then clearly AU5 
can be covered by some square C 6 <S(p + g). Thus we can choose a local decomposition of 
[5, V] as 

[ 5 >^ = E E D r,C (4-H) 
r>2 C€5(r) 

where 

°r,c= E E E (4.i2) 

P+g=r AeS(p) BeS(q) 
ACC BCC 

where x(A,B,C) = 0, 1 is some function that 'distributes' the commutators [S Pi A,Vq,B] over 
different terms D r c- A specific form of this function is not important for us. For fixed p, q such 
that p + q = r and a fixed C £ S(r) we can bound the number of squares A, B C C as 

#{A€S(p) : ACC} = (r-p) 2 = q 2 

and 

#{BeS(q) : BCC} = (r-q) 2 =p 2 . 

It yields 

\\D r ,c\\ < 2KJe-^ r P 2 ~ a q 2 ^ < 2KJe-^ r p- 2 q- 2 = cKJe'^ (4.13) 

p+q=r P,q>l 

for some constant c provided that a, /3 > 4. □ 

We shall also need the following simple lemma that will allow us to amplify the degree a 
by any constant by "borrowing" some decay from the exponential function. It makes the decay 
rate \i a bit smaller and the amplitude J a bit larger. 

Lemma 4.2 (Degree Reset). Suppose V is (J, /j,, (3) -decaying. Let < e < 1 and a > be 

any constants. Then V is also (</', fi', a + j3) -decaying where 

J' = CJ 1 ' 6 and n' = n - Jl (4.14) 
Here c is a constant depending on e and a. 

Proof. Indeed, let // = \i — S where 8 will be chosen later. Then 

\\V rA \\ < Jr-Pe-"* < Jr- a - /3 e-^ r maxq a e^ 5q < c6~ a Jr'^e-^ , (4.15) 

<j>0 

where c = max x >o x a e~ x = O(l) is a constant. Choosing 5 = J e l a we achieve the desired 
scaling. □ 
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Finally, we shall use the following trivial observation. 

Lemma 4.3. Suppose V is (J, ji, a) -decaying. Then V also has a local decomposition which is 
(cJe 2>1 , fi, a) -decaying with an extra property that V T: a acts trivially on all sites that are adjacent 
to the boundary of A. Here c = 0(1) is some constant. 

Proof. Indeed, replace each square A £ S{r) in the decomposition of V by a square A' £ 
S{r + 2) that contains A and all nearest neighbors of A. Let V^ +2 A , = V Tj a- Then V = 
J2r>i Y1agS(t) A 1S the desired decomposition. □ 

4.3 Proof of the main theorem 

In this section we prove Theorem [TJ 

Proof. We shall derive simplified flow equations for a triple of parameters J(n), Jd(n), and //(n) 
such that V(n) is ( J(n), ^(n), /3)-decaying and W(n) is ( Jd(n), fi(n), a)-decaying for all n > 0. 
Here a, f3 are sufficiently large constants that will be chosen later. Our manipulations with 
local decompositions will typically decrease the constants a and /3, so after each step of the flow 
equations we shall need to reset these constants back to their original values using Lemma 14.21 
Since W(0) = we can choose initial conditions 

J(0) = J, J d (0) = 0, and /z(0) = \i. (4.16) 

Let us prove that for any constant < e < 1 there exist constants ci, C2> C3 > such that for all 
k > one has 

J{k + 1) < Cl J{k) 2 ^- e \ (4.17) 
Jd(k + l) < c 2 J(k) 1 - t , (4.18) 

/M(k + 1) = ^(k)-c 3 J{k + l)JZ, (4.19) 

\\E(k + l)\\ < \\E(k)\\ + 0{L 3 )J(k)e~ C3L ^ k) (4.20) 

provided that J(0) is below some constant threshold value. Note that although e can be chosen 
arbitrarily close to 0, Eq. (|4.19p does not permit one to choose e = since otherwise the decay 
rate n{n) becomes negative after O(l) iterations. 

Supposed we have already proved Eqs. (|4.17H4.20l) for k = 0,1, ... ,n. Let us denote V = 
V{n), E = E{n), 

n 

w = j2 w ( k ) 

k=0 

and 

n 

Jd = Y. J ^)- 

k=0 
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Since n(k) is monotone decreasing for k = 0, . . . , n, we can safely assume that W is (Jd, /J>(n), a)- 
decaying. Also, since J(k) decreases doubly exponentially for k = 0, . . . , n we can assume (for 
k > 0) that 

Jd = 004(1)) = 0(J^ e ). (4.21) 

Let S be the solution of the linearized block-diagonalization problem Eq. (|4.4p constructed in 
Lemma 15.11 By construction S is (if, n(n), /3)-decaying, where 

= f^]- = 0(J(n)). (4.22) 
1 - c 2 Jd 

Here we assumed that J is sufficiently small, so that c 2 Jd < 1/2. Note that the assumptions of 
Lemma 15.11 require /3 > 2 and a > (3 + 4. 

Recall that and are raw versions of W (n + 1) and y(ra + 1) defined in Eq. (|4.5j) and 
Eq. (j4.6p . From Corollary 15.11 Section 15.31 we infer that the operator W has a local decompo- 
sition with block-diagonal terms which is (cJ(ra), /u(n))-decaying. Note that the assumptions of 
Corollary 15.11 require (3 > 2 and a > (3 + 4. The operator V can be rewritten as 

V = [S,V] +u(H), (4.23) 

where H = H + W+V and oj(H) = e s He~ s — H -[S, H]. We can assume that H is (c, /z(n), /?)- 
decaying where c = 0(1) and we assumed that (3 < a. Applying Lemma 14.11 from Section [4.21 
we get a local decomposition for [S, V] which is {c\ J(n) 2 , /x(n))-decaying for some constant c\ 
provided that f3 > 4. Applying Lemma fG.ll from Section[6]we get a local decomposition for oj(H) 
which is (02 J(n) 2 , /x(n) /2, — l)-decaying for some constant c 2 provided that /3 > 6. Summarizing, 
y is (cJ(n) 2 , fj,(n)/2, — l)-decaying where c is a constant. It will be convenient to keep the decay 
rates of W and V the same. Thus we shall assume that W is (cJ(n), /u(n)/2)-decaying (which 
is a weaker version of what we proved above). One can easily check that a choice 

a = 10 and = 6 (4.24) 

satisfies conditions of all lemmas used above. 

Recall that W{n + 1) and V(n + 1) are defined by taking the local decompositions of W and 
V and removing all terms associated with squares of size larger than L*. Therefore W(n + 1) 
and V(n + 1) have the same decay parameters as W and V, that is, we get 

J(n + 1) < ciJ(n) 2 , J d (n + 1) < c 2 J(n), /i(n + 1) = -fi(n) (4.25) 

for some constants ci, c 2 . Note that we have not reset a, (3 to their original values yet. The total 
number of squares of size r > L* is at most L 3 . Thus the contribution to the error Hamiltonian 
E(n + 1) can be estimated as 

\\E(n + 1)|| < \\E(n)\\ + 0(L 3 ) J(n)e^ (n)L * /2 = \\E(n)\\ + 0(L 3 ) J(n) e - C ^ (n)i (4.26) 

for some constant C3. Resetting the constants a, (3 using Lemma 14.21 we arrive at the desired 
flow equations Eqs. (|4.17«4.2Up . 
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Solving Eq. ()4.17|) we get 



J(n)<j(£\ , 9 = 2(l-e) (4.27) 

for some constant Jo > 0. Note that we are free to choose the constant e as small as possible. 
To simplify the formulas let us set 9 = 2. Also note that for small enough Jo we can assume 
that n(n) decays exponentially with exponent arbitrarily close to 1/2. To simplify the formulas 
let us assume that 

yj and /i(n) = /i2~". (4.28) 

Simple algebra shows that choosing the number of steps n = c log L for some constant c one can 
achieve the bounds 

\\V(n)l\\E(n)\\<poly(L)ex V (-cVL). (4.29) 

Here the exponent —c\/~L is determined by a tradeoff between the doubly exponential decay 
of J(n) and the exponential decay of //(n). Neglecting these exponentially small errors we can 
assume for simplicity that the Hamiltonian H(n) contains only block-diagonal contributions, 
that is, 

n 

H(n) = H + W, W = W(k). (4.30) 

k=0 

Here the local decomposition of W is ( Jd, 0, a)-decaying with Jd = 0(J 1_<E ), a = 10, and all 
terms in this decomposition are individually block-diagonal. In addition, by definition of W(k), 
this decomposition contains only squares of size r < L*. By performing an overall energy shift 
and using TQO-1 we can guarantee that every local term in the decomposition of W(k) has 
zero restriction on the P subspace (note that it increases the strength Jd at most by a factor of 
two). It allows us to apply the machinery of Section [3l In particular, Lemma I3T21 savs that W 
is relatively bounded by Hq with a constant 

b<0(J d )Y,r 2 - a = 0{J d ). (4.31) 

r>l 

Assuming that b < 1, Lemma [3.11 implies that the spectrum of Hq + W is contained in the union 
of intervals 1^ = (k(l — b), k(l + b)), where k = 0, 1, 2, . . .. The effect of V{n) and E{n) can 
now be taken into account using the standard perturbation theory by considering Hq + W as 
an unperturbed Hamiltonian and using Eq. (j4.29[) . 

Finally, notice that since the dominant contribution to W comes from the first level k = 1, 
we do not really need to perform the degree reset to estimate b (since the degree reset only 
changes our description of an operator but does not change the operator itself). Hence we can 
set Jd = O( J). The theorem is proved. □ 
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5 Linearized block-diagonalization problem 

5.1 Statement of the problem 

Consider a pair of perturbations 

l<r<L* AeS(r) 

W = E E ^ ( 5 - 2 ) 

l<r<L* yle<S(r) 

such that all terms W r A are block-diagonal, 

QW r , A P = for all r, A. (5.3) 

A linearized block-diagonalization problem can be divided into two parts. The first part is 
to find an anti-hermitian operator S such that 

Q([S,H + W} + V)P = 0, S^ = -S (5.4) 

and construct a local decomposition of S. The second part is to construct a local decomposition 
for the transformed Hamiltonian 

W= [S,H + W] + V = J2 W riA (5.5) 

r>l Ae<S(r) 

such that every term in the local decomposition is block-diagonal, that is, QW r A.P = for all 
r, A. We solve the two parts of the problem in Lemma 15.11 and its Corollary 15.11 respectively. 
Throughout this section we assume that Hq is a Hamiltonian defined in Eq. ()2.ip that obeys 
conditions TQO-1 and TQO-2. 



5.2 Finding the transformation S 

In this section we prove the following 

Lemma 5.1. Let V and W be perturbations defined in Eqs. \5.H5.M15.3\) . Suppose that V is 
(J, /x, f3)-decaying and W is (Jd, /i, a)-decaying such that (3 > 2 and a > (3 + 4. Then there exist 
constants c±,C2 > such that Eq. fl<5.^| ) has a solution S which is (K, fi, (3) -decaying with 

K = -^ J - T , (5.6) 

1 - C 2 Jd 

Proof. Let us start from several simplifying assumptions. Without loss of generality we can 
assume that 

W rA P = for all r, A. (5.7) 
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Indeed, condition TQO-1 guarantees that W r ^P is a multiple of P. Shifting W r A by the 
corresponding multiple of the identity we can satisfy Eq. (|5.7[) . On the other hand, one can 
easily check that Eq. (15. 4|) in invariant under such a shift. Note also that the shift can increase 
the norm || at most by a factor of two. Indeed, if W T) aP = cP then |c| < || W r a|| an d thus 

— cl\\ < [|W r ,A|| +|c| < 2||W r) ^||. Thus we can assume that W satisfies Eq. (15.70 provided 
that we change Jd to 2 Jd in the final answer. By the same reasons, we can assume that 

PV r A P = Q for all r, A (5.8) 

provided that we change J to 2J in the final answer. In addition, we can assume that V r a 
commutes with Gb, B G 5(2), whenever B is not contained in A, 

[V r ,A, G B ] = for all B G 5(2) such that B n A c / 0. (5.9) 

Indeed, by adding an idle layer of sites to each square in the decomposition of V as explained 
in Lemma 14.31 we guarantee Eq. (|5.9|) . The price we pay for this simplification is that J has to 
be changed to cJe 2/i = 0{J) in the final answer, see Lemma 14.31 Note also that now the local 
decomposition of V in Eq. (|5.1j) starts from squares of size r = 3, 

3<r<L* AeS(r) 

We shall construct a solution S as a series S = Ya^=i '> where is anti-hermitian for 
all i and 

Q([SW,H ] + V)P = 0, (5.11) 
Q{{S( i+1 \H ] + [S®,W])P = 0, fori>l. (5.12) 

For any region A C A define a restricted Hamiltonian 

H (A)= °b- (5-13) 

BeS(2) 
BCA 

Define also a super-operator Ea that takes as argument an arbitrary operator O and returns an 
operator 

£a(0) = QaH q (A)- 1 OP a - PaOH q (A)- 1 Q a . (5.14) 

Note that the Pa is the zero-subspace of Hq(A), so that QaHq(A)~ 1 is well-defined. 

Proposition 5.1. Let Oa be any operator acting on A such that POaP = 0. Suppose Oa 
commutes with Gb, B G 5(2), whenever B is not contained in A. Then 

Q([£a(Oa),Ho} + O a )P = 0. (5.15) 
If Oa is hermitian then £a(Oa) is anti-hermitian. 
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Proof. Indeed, since all terms in Hq which are not contained in A commute with £a(Oa) while 
H Q {A)P A = we have 

[£a(O a ),H ] = -QaOaPa - PaOaQa- 

It yields 

Q[£ a (Oa),H }P = -QQaOaP = -Q(Qa + Pa)O a P = -QO A P, 

where the first equality follows from PaP = P, QaP = 0, and the second equality uses identity 
PaOaP = POaP = 0. Thus we have proved Eq. (|5.15p . The last statement of the proposition 
is obvious. □ 

Using the assumptions Eqs. (|5.8l5.9p and the proposition we can choose S^ 1 ' in Eq. (|5.1ip 

as 

S W = E E S$. S^ = £ A (V r , A ). (5.16) 

r>3AeS{r) 

Taking into account that 

\\£a(O a )\\ < \\0 A \\ for any A (5.17) 
we conclude that Eq. (j5.16p is a local decomposition of which is (Ki, fj,, /3)-decaying where 

K x = J. (5.18) 

This decomposition has an extra property that S^ A is block-off-diagonal with respect to Pa, 
Qa, and commutes with Gb, B G 5(2), whenever B is not contained in A. 

Let us now solve Eq. (j5.12p . We shall assume as our induction hypothesis that possesses 
a local decomposition 

Sil) = E E 4!a (5-19) 

P>3 A£S(p) 

such that 

(i) 

11 Sp A is block-off-diagonal with respect to Pa, Qa 

12 Sp A commutes with Gb, B E 5(2), whenever B is not contained in A 

13 \\[H (A),S^}\\<K ip -Pe-w 

Here the coefficient Ki will be determined inductively in terms of K%—\. Note that combining 
(II), (13) with the fact that Q A H (A) > I one gets 

= \\QaS { p %Pa\\ < \\QaH (A)S { p %P a \\ = || [H (A),S^] || < K ip -^, 

that is, SW is (Ki, /x, /3)-decaying. 

The base of induction is % = 1 which we have already proved. Our first step will be choosing 
a local decomposition for the commutator [S^\ W] in Eq. (j 5 . 1 2 [) . We shall need the following 
geometrical fact. 
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Proposition 5.2. Let A £ S(p), p > 2, and B £ 5(g), q < L, be any squares such that 
A n B 7^ 0. T/ten i/iere exists a square C £ S(r), r = min (p + g, L), suc/i i/iai ^4 U B C C and 
C contains all nearest neighbors of B. 

Proof. If r = L the statement is obvious, so assume r = p + q < L. Define a metric on the 
lattice using the Zoo-norm, that is, if u = (u x , u y ) and v = (v x , v y ) is a pair of sites, then 

D(u,v) = m&x{\u x - v x \, \u y - v y \}. 

For any region M C A let D(M) be the diameter of M, i.e., the largest distance between 
a pair of sites u,v £ M. Clearly D{A) = p — 1 and D{B) = q — 1. Since A n B ^ we 
have D(A D B) < D{A) + D(B) = p + q — 2. Therefore, j4 U -B can be covered by a square 
C G + q — 1). If C = B one actually has C G 5(g). Now one can choose C as an arbitrary 
square of size r that contains C and all nearest neighbors of C . Suppose now that C ^ B. 
Then either C contains all nearest neighbors of B, or C shares an edge or a corner with B. In 
the latter case, we can extend the size of C by one obtaining a square C £ S(p + q) with the 
desired properties. □ 

The proposition implies that 

r>3 CeS(r) 

where 

^!c= E E E [S^,W q , B ]x(A,B,C) (5.21) 
P+g=r Ae5(p) BeS(?) 

ACC BCC 

and x(A, -B, C) =0,1 is some function that 'distributes' the commutators over different terms 
D t C . A particular choice of such distribution is not important for us. Using Proposition 15.21 we 
can assume that B,C) = unless A U B C C and C contains all nearest neighbors of B. 
By construction, Dy c has support on C, that is, Eqs. (|5.20I5.21|) define a local decomposition 

of the commutator [S^\ W]. 

Using Proposition 15 . 1 1 we can choose a solution S"^ +1 ) of Eq. (|5.12p as 

S (m) = E E S^=£ C (D%). (5.22) 

r>3 CeS(r) 

This is a local decomposition of S 1 ^ 1 ) that satisfies (II) by definition of the map £c- Let us 
check that it satisfies (12). Indeed, let Gp, F £ 5(2), be such that F is not contained in C. 
Then F is not contained in A C C and thus G^r commutes with all \ in Eq. (I5.2ip . Since 
for all terms W Qi b m Eq. ()5.2ip the square C contains both B and the nearest neighbors of B, 
we conclude that F does not overlap with B, that is, Gp commutes with W q: B- Finally, Gf 
commutes with all Krauss operators involved in the map £c- Thus Gf commutes with S^ 1 ^ 
which proves (12). 
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It remains to verify that the local decomposition Eq. (|5.22p satisfies (13). It will be convenient 
to introduce an auxiliary Hamiltonian 

W q (A,C)= x(AB,C)W q , B . (5.23) 

BeS(q) 
BCC 

It allows us to rewrite Eq. ()5.21[) as 

D %= E E [S^,W q (A,C)}. (5.24) 

p+q=r A£S(p) 
ACC 

Applying Corollary 13.21 using Eq. (|5.7|) . and taking into account that W is ( Jd, fJ>, a)-decaying 
we get 

WmC^S^W = \\QcD%P c \\< £ \\Qc[S^W g (A,C)]Pc\\ 

p+q=r AGS(p) 
ACC 

< E E b q \\Q c [H (C),S^]Pc\\, (5.25) 

p+q=r AeS(p) 
ACC 

where 

b q < cJ d q 2 - a e-w (5.26) 

for some constant c. From (II) we infer that S^ A is block-off-diagonal with respect to Pa, Qa 

while (12) implies [Hq{C),S^ a \ = [Hq(A), Sp A ]. Taking into account that Pc = PaPc we get 
a bound 

\\Qc[H (C),S^]P c \\ < \\Q A [Ho(A),S^]P A \\ = \\ [H (A),S^ A ] || < J^p-V* (5.27) 

where the last inequality follows from (13). Combining Eqs. (|5.25|I5.27|) and noting that the 
number of squares A € <S(p) such that A C C is equal to (r — p) 2 = q 2 we get 

r-l 

llftfotc),^]!! < 

9=1 

r-l 

< cJ^e-^^g 4 - Q (r-g)^. (5.28) 

9=1 

It is convenient to split the sum over q into two parts: 

Y Q 4 ~ a (r - qY P < cr-l 3 ^ q 4 ~ a = t/r~ p (5.29) 

l<q<r/2 q>l 

for some constants c, c' since we assumed a > f3 + 4 > 6. As for the other part, we have 

Y cf~ a { r -Q)' 13 < cr 4 ~ a Y <T P = c ' r4 ~ a < c ' r ~ ( 5 - 30 ) 

r/2<q<r-l q>l 
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for some constants c, c' since we assumed that /3 > 2 and a > /3 + 4. Therefore we arrive at 

|| [^(C),^] || < cJ d K ir -P e -» r (5.31) 

for some constant c which proves (13) for 

K i+1 = cJ d Ki. (5.32) 

Thus all induction assumptions are proved for S^ +1 \ By obvious reasons S = ^j>i &i ^ s 
(K, n, /3)-decaying with 

K<'£K i = -^—. (5.33) 

□ 

5.3 Local decomposition of the transformed Hamiltonian 

Lemma 15. II has the following corollary. 

Corollary 5.1. Let V, W, and S be as in Lemma l5.il Suppose V is (J, /i, (3) -decaying and 
W is (Jd, /i, a) -decaying for some /3 > 2 and a > (3 + 4. XTien a transformed Hamiltonian 
W = [S, Hq + W] + V has a local decomposition 

w = y^ E ^ a ( 5 - 34 ) 

r>l AeS(r) 

such that QW rt AP = for all r,A. This decomposition is (J d, jj)- decaying, where 



Jd < rr^-r (5.35) 
1 - cJ d 



for some constant c. 



Proof. We shall use notations and techniques introduced in the proof of Lemma 15.11 By defini- 
tion of S we have 

W = J2W^, (5.36) 



i>l 

where 

= [SW , H ] + V, and W« = [S® , H ] + [S^ ,W] for i > 2. (5.37) 
Let us choose the local decomposition of as 

= E E <i> ( 5 - 38 ) 

r>3 AeS(r) 

where 

wfi = [f a(V p>a ), Ho] + ^r,A = PAVr,A p A + Qa^aQa- (5.39) 
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Here the last equality uses Proposition l5.1l (see the first equation in the proof of the proposition). 
Obviously, ||W r ^|| < and thus is ( J, fi, /3)-decaying. As was noticed in the proof of 

Lemma 15. II we can assume that V t% a commutes with all operators Gb, B 6 5(2) for which B is 
not contained in A. It means that 

PW rA = P Vr,A P A = PVr,A P = W$P, (5.40) 

that is, W^~\ is block-diagonal. 

Recall that we have a decomposition 

r>3 CeS(r) 

where D^q commutes with all operators Gb, B € 5(2) for which B is not contained in C, see 
Eqs. (j5.2QI5.2ip in the proof of Lemma [5.11 It means that we can choose the local decomposition 
of WW with % > 2 as 

W ^ = e E ( 5 - 42 ) 

r>3 CeS(r) 

where 

W$ = [Sc{D<fc\ H ] + L^f = PcD^Pc + QcD% l) Qc, (5.43) 

see Eq. (|5.22p . Obviously, ||W r ^|| < \\D^c II an d is block-diagonal. Using the fact that 
S'W is (i^j, |U, /3)-decaying where if, is defined by Eqs. (15. 18II5.32() . and using Lemma [4.11 we 
conclude that is (cJ^-Kj-i, /i)-decaying. Using Eq. (|5.36p we obtain a local decomposition 
of W with individually block-diagonal local terms which is ( J^, /u)-decaying with 

J d = J + E cJ ^-i ^ J + E J ( cJ ^) i = 7~Z~~T' ( 5 - 44 ) 

i>2 i>l 

Finally we have to replace J and by O(J) and O(Jd) to justify our assumptions Eqs. (j5.7l5.8l5.9p . 

□ 



6 Lieb-Robinson bounds 

Let S be some anti-hermitian operator and V be an arbitrary operator. We shall use notations 

T (V) = e s Ve~ s , 

u(V) = t(V)-V-[S,V}. (6.1) 

Suppose we are given some local decompositions of S and V. In order to get a closed system of 
flow equations we need to construct a local decomposition for tv(V), see Section 01 The main 
result of this section is the following lemma. 



29 



Lemma 6.1. Suppose S is (K, fi, a) -decaying for some a > 6. Suppose V is {J, [i, (3) -decaying 
for some (3 > 6. Then uj(V) has a local decomposition which is (cJK 2 , /i/2, — 1)- decaying for 
some constant c. 

Thus the magnitude of interactions of range r in tv(V) decays as cJ^re - ^ 7 '^ 2 . Our ar- 
guments will rely on the Lieb- Robinson bound, see [20J. More specifically, we shall exploit 
quasi-locality of dynamics in quantum spin systems with a fast decay of interactions as pre- 
sented in j!9j. The extra factor 1/2 in the decay rate of oj(V) represents a simple geometrical 
fact that the diameter of the light-cone of any local region increases with a rate 2v , where v 
is the Lieb- Robinson velocity. This extra factor 1/2 is the price one has to pay for using the 
powerful machinery built on the Lieb-Robinson bound. 

We shall start from solving a somewhat simpler problem. Let O be any operator with 
support on some square B G S{q). Consider a local decomposition of S, 

P>2 AeS(p) 

Let C G S{q + 2j) be a square that contains B and all sites within distance j from B (with 
respect to the /^-distance), that is, 

C = {ue A : D(u,B) <j}. (6.3) 

Let Sc be a localized version of S obtained by taking out all interactions whose support is not 
contained in C, 

p>2 A€S(p) 
ACC 

Define also a localized version of oj(0), that is, 

u c (p) = e Sc Oe~ Sc -O- [S c , O}. (6.5) 

By definition, u>c(0) has support on C. We shall need a bound on the difference || ui(0) —ujc{0) \\ 
that is proportional to ||0|| • K 2 e~^ . 

Lemma 6.2 (Quasi-Local Dynamics). Let S, Sc and O be the operators defined above. 
Suppose S is (K, n, a)-decaying for some a > 6. Then there exist constants cq, c\ > such that 

\\u C (0) - u>(Q)\\ <c (q + j)q 4 K 2 \\O\\e-M (6.6) 

whenever K < c\. 
Proof. Define 

T \0) = e st Oe~ st , T c (0) = e Sct Oe- Sct . (6.7) 
For any < t < 1 define an operator 

f(t) = r c (0) - r\0) - [S c - S, 0]t. (6.8) 
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Note that f(t) is an analytic function and 

/(0) = /(0) = 0, f(l)=cu c (0)-u(0). (6.9) 

Computing the derivatives over t we get 

/(*) = [SctUO)} - [S,t\0)] - [S c -S,0], (6.10) 

f{t) = [S C , [ScrhiO)}] - [S, [S,t\0)]]. (6.11) 
Let us extract from fit) a norm preserving term [S, fit)]. After simple algebra we get 

fit) = [S, fit)] -[S- S C , [Sc, rhiO)]] - [S, [S - S c , O}]. (6.12) 

It means that 

fit) = t* (J* t~ s i[[S C) T s c iO)\, ASc] + [[AS C , O], S}) ds^j , (6.13) 
where ASc = S — Sc- Taking into account the initial conditions Eq. (I6.9P we get 

||/(1)|| <J dtxWfihW <±\\[[AS C ,0], S]\\+£dtx j\h ||[r*?([5c,0]),A5 c ]||. (6.14) 

Let us start from bounding the time-independent term [| [[A5c, O], S] ||. Denote 

F p2iPl = #{iE 2 , E X ) : E 2 G Sip 2 ), Ex G Sfa), E 2 n^UB)/!, ExHB^ 0}. 

One can easily check that 

r p2 , Pl < iq + px + P2?iq + Pi? < 2iq + pxf + 2(? + pxfpi 

The commutator [ASc, O] has contributions only from squares of size > j in the decomposition 
of S, which implies 

||[[A5 c ,o],5]||< c ^ 2 ||o||^ £ r^^pr^v^ 1 ^- 

Pl>j P2>1 

Here and below c stands for a constant factor. Since a > 6 the sum over p 2 is bounded by a 
constant and we arrive at 

\\[[AS c ,0],S]\\ < cK 2 \\0\\e-^ £(g + pi)V a = cg 4 iT 2 ||0||e-«. (6.15) 

pi>j 

Let us now bound ([5cs O]), ASc]\\- Note that the commutator has contributions only 
from those terms in the decomposition of ASc that overlap with both C and its complement 
C c . Define 

n pi,P2(*) = ™ax max max || [t£([Oi, O]), 2 ] \\, 
E t eS(pi) E 2 €S(p 2 ) Oi,0 2 

E 2 nC c ^0 
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where 0\ , O2 are unit-norm operators acting on E\ , E2 respectively. Counting the number of 
squares contributing to the double commutator yields 

|| [rh([S c , O]), AS C ] || <c(q + j)K 2 (l+Pl) 2 Pl a pl' a ^ 1+P2) ^, P M 

P1.P2>1 

As we show below, the unitary evolution under Sc can be characterized by a finite Lieb- Robinson 
velocity vlr = O(K). Using the Lieb- Robinson bound from [19] that governs unitary evolution 
under Hamiltonians with exponentially decaying interactions one gets 

tt pi , P2 (t) < c\\0\\(pi + g) 2 p|exp [v LR t - fiOij-pt - p 2 )], 

where 0{x) = x for x > and 9(x) = for x < 0. Note that 6(j — p\ — P2) is a lower 
bound on the distance between supports of [Oi,0] and O2 in the definition of Q, PltP2 . Clearly, 
Pi +P2 + — Pi — P2) > 3- Since < t < 1 we can assume that VLRt = O(Kt) = O(l). Hence 

\\[Th({Sc,0]),AS c ]\\<c(q + j)\\0\\K 2 e-M £ (q + Pl ) A p^ a pt a < c(q + j)q 4 \\0\\K 2 e-^ 

P1,P2>1 

( 6 - 16 ) 

for q > 6. Combining Eqs. ()6.15I6.16|) and computing the integrals in Eq. (|6.14|) we arrive at 



|/(1)| <c(q+j)q 4 \\0\\K 2 e 



i-'j 



It remains to check that we have fast enough decay of interactions in S to use the Lieb- 
Robinson bound from Ref. |19j . Below, we use notations from Ref. [19]. Define a function 



W _ 2Ei^E). (6 . 17) 



As was shown in Ref. [19] the Lieb-Robinson velocity is bounded by a multiple of: 

" ,ofcA r>UeS(r) MV v /y 
ABu,v 

with the multiplying factor being 2Cb(l), where Co(l) is a numerical constant (depending only 
on the dimensionality of the system; in our case the dimension is 2.) For any pair of sites u, v S A 
with A £ S(r) one has D(u,v) < r (here and below we use the Zoo-distance). Since F fl (x) is 
monotone-decreasing we have F fl (D(u,v)) > F^{r). Taking into account that the number of 
squares A £ S(r) such that A 3 u is at most r 2 we get 

\\SWp < K^V(l + r 2 )r~ a < cK, c = ^(r~ 2 + r" 4 ) < 4, (6.19) 

r>l r>l 

provided that a > 6. Thus, the Lieb-Robinson velocity is bounded by 8Cq(1)K. □ 
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Let us use Lemma [6.2l to construct a local decomposition for uj{0). This local decomposition 
will involve a sequence of squares 

fi = BcBiCB 2 C..cA (6.20) 

such that Bj £ S(q + 2j) is the square obtained from B by adding a boundary region of thickness 
j on all sides of B, that is, 

Bj = {u € A : L>(u, B) < j}. (6.21) 
Note that Bj = A for large enough j. Then we can write co(0) as 

u(0) = D B0 (O) + Y,D B] (O), D B0 (O)=uj b (O), D B .(p)=uj B .(0)-uj Bj _ l (0). (6.22) 

Note that D Bj (0) acts only on Bj, so Eq. f|6.22|) defines a local decomposition of w(O). Using 
Lemma 16.21 we infer that 



\\D Bj (0)\\ < \\0J Bj (O) - oj(0)\\ + Wm.^O) - u(0)\\ <c(q + j)q 4 K 2 -\\0\\e~M (6.23) 
for some constant c. In addition we have a standard bound (see for instance Ref. [7]) 

||Db (O)|| = \\u> B (0)\\ < ^\\[S B ,[S B ,0]}\\<2\\S B \\ 2 \\0\\. (6.24) 
Taking into account that 

Q Q 

W S b\\<Y1 E ll^ll <^2(q-p) 2 Kp- a e-^<cq 2 K (6.25) 

P=2 AeS(p) P=2 
ACB 

for some constant c. Thus for all j > we have 

\\D Bj (0)\\ <cq\q+j)K 2 \\0\\e-M. (6.26) 
Having finished this warmup we can easily prove Lemma 16.11 
Proof of Lemma 16.11 Consider a local decomposition of V, 

V = J2 E KbII < ^e-"*. (6.27) 

<?>2 BeS(g) 

We construct a local decomposition of (^(V^s) as in Eq. ()6.22|) . where O = V q>B . We get 

0J(V)=J2 J2 n r,A> ( 6 - 28 ) 
r>2 AeS(r) 

where 

r/2-l 

E E ^(^- 2Ji b)- (6.29) 

j=0 BeS(r-2j) 
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Note that for fixed j the sum over B contains a single square B such that A = Bj, see Eq. (|6.2ip . 
Using Eq. (|6.26p with q = r — 2j, we arrive at: 

r/2-l 

H^MI < E E K 2 \\V r ^ hB \\{r-2 3 ) A re-M 

j=0 BeS(r-2j) 
A=B J 

r/2-1 

< K 2 J(r - 2jf-Pre~^ r - j) < cK 2 rJ(T^. 

3=0 

for some constant c provided that f3 > 6. We have shown that w(V) is (cK 2 J, /i/2, — l)-decaying. 

□ 



7 Adiabatic continuation of logical operators 
7.1 Dressed Operators 

Since the gap remains open up to a certain strength of perturbation, this means that the 
perturbed Hamiltonian Hq + V is adiabatically connected to the original Hamiltonian, Hq. This 
adiabatic connection suggests that the perturbed Hamiltonian should have similar properties to 
the unperturbed Hamiltonian. For example, following |3], we can define string operators which 
have nontrivial action in the ground state subspace of the perturbed Hamiltonian and which 
have the correct commutation relations and expectation values. In fact, Theorem [1] will allow 
us to do even more, to define local operators that create defect excitations with well-defined 
energies. 

To construct these operators, we use quasi-adiabatic continuation. We define a continuous 
family of Hamiltonians, 

H S = H + sV, (7.1) 

so that as s varies from to 1, H s continuously interpolates between Hq and the perturbed 
Hamiltonian. 

We define a quasi-adiabatic continuation operator, T> s by 

V s = i j dtF(t)exp(iH s t)(d s H s ^exp(-iH s t), (7.2) 

where the function F(t) is defined to have the following properties. First, the Fourier transform 
of F(t), which we denote F(uj), obeys 

\u\ > 1/2 -> F(w) = -l/w. (7.3) 

Second, F(co) is infinitely differentiable, so that F(t) decays faster than any negative power of 
time for large \t\. Third, F(t) = —F(—t), so that T> s is anti-Hermitian. 
We define a unitary operator U s by 

U a = S' ex p{^ ds'V s }, (7.4) 
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where the notation S' denotes that the above equation ([7.4J1 is an s'-ordered exponential. 

The motivation for defining the above unitary operator is contained in the following lemmas: 

Lemma 7.1. Let H s be a differentiable family of Hamiltonians. Let \^ l (s)) denote eigenstates 
of H s with energies Ei. Let E m i n (s) < E max (s) be continuous functions of s and 

L(s) = {AeR: E min (s) < A < E max (s)}. 

Define a projector P(s) onto an eigenspace of H s by 

P(s)= £ \¥(s))(¥(s)\. (7.5) 

Assume that the space that P(s) projects onto is separated from the rest of the spectrum by a 
gap of at least 1/2 for all s with < s < 1. That is, any eigenvalue of H s either belongs to L(s) 
or is separated from L(s) by a gap at least 1/2. Then, for all s with < s < 1, we have 

P(s) = U s P(0)Ul (7.6) 

Proof. By linear perturbation theory, 

d s P(s) = E E ^^\^Hs))(^ j (s)\d s H(sW(s)))(¥(s)\ + h.c (7.7) 
i€i(<) jM) 1 3 

= E E l* j ( s )>(V( s )l [ dtF(t)eMiHst)d s H(s)exp(-iH s t)\^(s))yW i (s)\+h.c 
= i[V s ,P s ]. 

The first equality in the above equation holds because 

(¥(s)\ J dtF(t) exp(iH s t)d s H(s) exp(-iH s t)\¥(s)) (7.8) 

= | JdtF(t)exp[i(Ej - Ei)t]d s H(s)\¥(s)) 

= FiEj (s)\d.H(a)\* i (s)), 

where we use Eq. (|7.3|) to show F(Ej — Ei) = —l/(Ej — Ei) using the assumption on the gap 
in the spectrum that \Ej — Ei\ > 1/2. 

Since d s {U s P(0)u!) = i[V s , U s P(0)Ul], and U Q = I, Eq. (JEBJ follows from Eq. §71$. □ 

This purpose for introducing this quasi-adiabatic continuation operator is to define certain 
"dressed" operators, following the idea introduced in [3]. Let Oi,02, ••• be some operators that 
create defects when acting on the ground state of Hq. These operators may be defined to 
have certain commutation or anti-commutation requirements. For example, if we have certain 
operators Of which create electric defects on a given neighboring pair of sites, and operators 
Of 1 which create magnetic defects on a given neighboring pair of plaquettes in a toric code 
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state, then these operators all commute with each other, except an electric and a magnetic 
operator anti-commute if the bond connecting the sites and the bond connecting the plaquettes 
on the dual lattice intersect. Then we define "dressed" operators Oi(s) by 

Oi(s) = U s OiUl (7.9) 

Since U s is unitary, these operators Oi(s) obeys the same commutation or anti-commutation 
requirements: 

[O h Oj] = -> [Oi(s), Ojis)] = 0, (7.10) 
{O i ,O j } = 0->{O i (s),O j (s)} = 0. 

Let P n (s) project onto the eigenspace of H(s) with energy in the interval I n , see Theorem[TJ 
We assume that J is chosen sufficiently small (see Section [1. lj) so that I n is separated from the 
rest of the spectrum by a gap at least 1/2. So, Lemma 17711 implies that if a product of operators 
0\Oi- -O m acting any ground state of Hq creates an eigenstate of Ho with energy n, then the 
product of operators Oi(s)02(s)...O m (s) acting on any ground state state of H s creates a state 
with energy in the interval I n . To see this, note that a ground state ^o( s ) of H s can be written 
as U s ^o(0) for some ground state ^o(O) of Hq. Then, 

P n {s)O l (s)...O m (s)\^ {s)) = U s P n (0)O 1 (0)...O m (0)\y ) (7.11) 
= Z7,(0)Oi(0)...O m (0)|* > = O 1 (s)...O m (s)\^ ( S )). 

Further, if some product of operators 0\0<i— has a given expectation value in the ground 
state of Hq, then the corresponding dressed operators have the same expectation value in the 
ground state of H s . For example, since a product of a x around a contractable loop has expec- 
tation value unity in the ground state of the toric code, the same product of dressed operators 
will have the same expectation value in the ground state of H s . 

The next important property we want to show is that the operators Oj(s) are local. To 
do these, we need a Lieb-Robinson bound for quasi-adiabatic continuation. Before describing 
this, some comments. We have chosen to define the quasi-adiabatic continuation using "exact" 
expressions. That is, we have chosen a filter function F(t) such that its Fourier transform is 
exactly equal to 1/u outside a certain interval. This is Osborne's[2T] modification of the quasi- 
adiabatic continuation of [3J. As a result, our filter function F(t) decays faster than any negative 
power of t, but does not decay exponentially in t (however, there do exist such filter functions 
with |-F(i)| decaying exponentially in a polynomial of t [23]). This contrasts with the approach 
in [3J, where an approximation was used that gave a filter function decaying exponentially in t 2 . 
Each approach has certain advantages and disadvantages. The approach in [3J has the advantage 
that one can often get exponentially good approximations, rather than approximations which 
merely decay faster than any power. However, Osborne's modification which we use here has 
a few advantages also. First, we get an exact result that ^o(s) = U s ^o(0) above, while the 
approach in |3j only gives approximate estimates. Second, it is much easier to derive locality 
estimates. In particular, the Lieb-Robinson bound for quasi-adiabatic continuation in the next 
lemma, which is the key step to prove locality of the dressed operators, is much easier than 
[31 [52] . The reason that this bound is so much simpler is the following: the Fourier transform 
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of the filter function here can be chosen to be some given bounded function of w. In [3j 122j . 
we have a parameter-dependent family of filter functions. As this parameter is increased, the 
accuracy of the approximation improves, but also the upper bound on the Fourier transform of 
the filter function gets worse, and hence the bound on the norm of T> worsens. 

The locality of the dressed operators depends on locality properties of Hq, V. We require 
that Hq,V are both sums of of local operators Ha t z,Vz, where both Ho t z,Vz obey a bound 
that, for all sites u £ A, 



\\H 0) z\\\Z\ exp(/xdiam(Z)) = O(l), (7.12) 
^2 \\V Z \\\Z\ exp(/xdiam(Z)) < J < oo, 



where \Z\ denotes the cardinality of Z, for some positive constants /i, J. 

Given such locality property, for any finite dimensional system we have a Lieb-Robinson 
bound for Hq + sV that, for any operator Oa supported on set A and any Ob supported on set 
B, 

\\[exp(iH s t)0 A exv(-iH s t),0 B }\\ < exp[- fi (6M(A,B)-v LR t)]\A\\\OA\\\\0 B \\, (7.13) 
where vlr is some constant which depends on fj,, J. This bound is shown in |20j. 



Lemma 7.2. Let Hq and V obey Eq. \7.12 ). Then, if Oa is supported on set A and Ob is 
supported on set B, 

\\[U s O A UlO B ]\\ <h(di S t(A,B))\A\\\0 A \\\\0 B \\, (7.14) 

where \A\ denotes the cardinality of A, for < s < 1, for some function h(l) which decays faster 
than any negative power of I. Similarly, 

\\[uIO a U s ,Ob}\\ < /i(dist(A,S))|A|||0 A ||||0 B ||, (7.15) 

Further, the operator D s is a sum of operators T> S (Z), with 

\\V S (Z)\\ < const, x IIV^H, (7.16) 

and with the property that each such operator T> S (Z) obeys, for any operator Ob, 

\\[V s (Z),0 B ]\\ < fc'(dist(A,B))|^|||yz||||OB||, (7.17) 
for some function h'(l) which decays faster than any negative power of I. 
Proof. We have 



where 



V S = Y J V S (Z), (7.18) 

z 

V s (Z) = i J dtF(t)exp(iH s t)V z exp(-iH s t). (7.19) 
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Eq. (|7.16p follows immediately from a triangle inequality because J dt\F(t)\ converges. 
Let Ob be an operator supported on set B. Then 

\\[V s {Z),O b ]\\ < J dt\F(t)\\\[eMiH s t)V z exp(-iH s t),0 B }\\. (7.20) 

For t < dist(Z, B)/2vlr, the above expression is exponentially small in dist(Z,B) by the Lieb- 
Robinson bound, while for larger t, the expression is bounded by and hence this com- 

mutator decays faster thany any negative power of dist(Z, B). 

This decomposition and locality bound (I7.20p implies that there is a Lieb-Robinson bound 
for evolution using T> s as a Hamiltonian. Using the Lieb-Robinson bound in [20] (the use of 
this bound does require some geometric properties of the lattice, which hold for any finite 
dimensional lattice), we have that 

\\[U s O A UlO B }\\ < exp(cs)h'(dist(A,B))\A\\\0 A \\\\0 B l (7.21) 

for some constant c which depends on F, J, fi and on the geometric properties of the lattice, 
and for some function h'{l) which decays faster than any negative power of I. Since we assume 
s < 1, Eq. (l7TT3jl follows from Eq. (17311) . □ 

Eq. (l7TT3j) implies that O a (s) = U s O A u\ can be approximated by an operator 0\ localized 
on the set of sites within distance I of set A. To see this, define Oj = / dUUO A {s)U\ where 
the integral ranges over unitary rotations, with Haar measure, supported on sites with distance 
greater than I from set A. Then, following [2], the desired result follows. 

The string operators can be used to manipulate the ground states. We will assume that the 
operators 0\ which create defects are, in fact, unitaries. For example, if a certain product of 
electric operators, 0\02--- creates a nontrivial loop around the torus and has nontrivial action 
on the ground state of Hq, then the product of dressed operators Oi(s)02 (*)••• has the same 
action on the ground state of H s . However, one might wonder how to create these dressed 
operators; it would be preferable not to have to solve quasi-adiabatic evolution equations to 
calculate what the operators should be, and then to produce them by careful control of a time- 
dependent Hamiltonian. Fortunately, we can use the gaps in the excited state spectrum to 
argue that it is possible to drag defects in the perturbed Hamiltonian H s in an identical way to 
what is done in Hq. First, we apply some local operator in an attempt to create a defect pair 
on neighboring sites. If this operator is not exactly equal to the desired dressed operator, then 
this attempt may fail, in the sense that the energy may not fall into the desired range I2, see 
Theorem [TJ We can detect this failure by measuring the energy, and cooling back to the ground 
state: that is, if there are extra defects created, we will drag them together, using the procedure 
described in the next paragraph, to annihilate them. Since the dressed operator is local, then 
some local operator (for example, the undressed operator) is expected to have non-vanishing 
matrix elements between the ground state and the desired state. Eventually we will succeed in 
creating the defect pair. 

We now try to move the defect pair by weak changes in the Hamiltonian. We add a pertur- 
bation U to the Hamiltonian with norm bounded by 1/4. Such a weak perturbation will keep 
the gap open to the states with 3 defects (of course, there are no such three defect states in the 
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toric code, but in more general models there are). So, by adding this perturbation and changing 
it adiabatically, we remain in the subspace with a defect pair. 

In a toric code Hamiltonian, every state in this subspace is a linear combination of states 
which are given by acting on a ground state ^o( s ) by a linear combination of products of strings 
of dressed operators. Consider a given state created by a single string of dressed operators, with 
endpoints i and j which are far separated. Call this state ^(S), where S is a string with 
endpoints at Note that using the property that any contr actable loop of dressed operators 
has expectation value unity in the ground state of H s , we can show that we can deform these 
strings in any way that leaves the endpoints fixed while leaving the state unchanged. Let us 
now see how the expectation value of local operators can change in the state ^(S). If O is 
an operator which is far separated from we claim that the expectation value of O in state 
ty(S) is close to its value in the ground state. To see these, note that that we can define a state 
ty(S') = ^(S) where S' is a string which stays far from O. So, without loss of generality, we 
can assume that string S is far from O. Then, the using the locality properties of the dressed 
operators, they almost commute with O, and so 

(tf(S)|0|tf(S)> = (* (s)\Oi..O\OO 1 ...O n \* (s)) (7.22) 
« {* {s)\Ol..o{o 1 ...O n O\* (8)) 
= (tf (s)|0|*o(s)>- 

So, the perturbation U can effect the state, but only if U acts close to one of the endpoints 
of the string. Further, if U is local, then one may show that U will have small matrix elements 
except between *&(S) and ty(S') for strings S' which differ from S only by small motion of one 
of the endpoints. Thus, if U is chosen to have the property that it reduces the energy when 
an end of the string is close to a certain point, the operator U can indeed by used to drag the 
defects. 

So, we have established that it is possible to create a defect pair and drag it. The gap to 
the rest of the spectrum prevents additional defects from being created. If they are created, we 
can detect them by measuring the energy, and we can drag them to destroy them and correct 
errors. 

We can drag one of the defects around the system. Since at every step we remain in a state 
which is a linear combination of states ^(S 1 ), and eventually we succeed in dragging a defect 
all the way around the system, if we are able to return to a ground state ^o( s )' then ^dl 5 ) IS 
produced by a local operation O on a state %f(S) where S is a string with two nearby endpoints 
connected by a string that goes around the sample. Any such state ^(S) is a product of dressed 
operators acting on \I/o(s). So, it is equal to ^(S) = U s Oi...O n ^o(0). Suppose v I / o(' s ) * s a ground 
state such that (*q(s), 0^>(S)} is non-negligible. This expectation value is equal to 

(*( ) (0)|(c/ s t OC/ s )Oi...O n |*o(0)). (7.23) 

However, due to the locality properties of quasi-adiabatic continuation, the operator utOU s is 
approximately local. Hence, the ground state ^d(O) of the unperturbed system is connected 
by a local operator to the state Oi...O n \^o(0)) . So, the state ^' (0) rnust be close to the state 
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formed by acting on x I / o(0) with a noncontractible string that winds around the sample. Thus, by 
dragging the defects in the perturbed system, we succeed in effecting the desired transformation 
on the ground state sector of the perturbed system. 

One thing that can go wrong in this procedure is that we might inadvertently create the 
wrong type of defect. We might accidentally create a magnetic defect pair rather than an electric 
defect pair (one may show that the perturbation U will have small amplitude to change electric 
into magnetic defects if the defects are far separated and U is local). However, we expect to be 
able to locally tell the difference between these types of defects and thus determine what kind 
of defect has been created, and, if the wrong type has been created, to destroy the defect pair 
by bringing them together. 

One would like to improve the arguments here to a full formal proof that using classical 
control it is possible to correct local errors and perform controlled operations in this perturbed 
toric code. We will leave a complete proof of this to the future, but we have established the 
existence of operators with the necessary properties and of the gaps in the spectrum. 
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